Configuring: Random Values
Generate a dataset with a configured number of fields of each specified type and a defined number of rows. Random values will populate the dataset according to the field specifications.
Steps to configure
-
Field selection: Define the number of fields to be generated within the random dataset.
-
Field configuration: Define the boundaries for doubles, integers and strings.
-
Configure timestamps: Define the boundaries for timestamps. if you have opted to include a timestamp, you will define the timestamp boundaries in this window.
-
Execute: a dataset of random values is generated
Step 1: Field selection
Field Count:
Configure the number of data fields required, the type of field data and the number of timestamps required, if any. Continuous processes require a timestamp, whereas discrete and batch processes can be configured with or without a timestamp.
-
The data table generated will list the number or fields across the columns, corresponding to the numbers inserted in the field counts windows. The order of the columns generated will always be: doubles; integers; strings; timestamps.
-
The number of fields selected will influence the number of rows that can be generated, according to the limitations of a dataset.
-
Default: 1 double field will be generated.
-
Minimum configuration: at least 1 field is required to configure the dataset.
Field names:
The field names will be applied to every field. The example previews the name as will be given to the first field, hence the value 1 will be shown in the example.
-
A prefix is always required; a postfix is optional.
-
Any incorrect values, such as a character, will automatically be changed to the correct Troubleshooter format, using underscores. This is only evident when you click out of the box.
-
If the character selected is not previewed in the example, this character is not supported as a field name. This is only evident when you click out of the box.
-
Default: Prefix = Field, with no postfix value.
-
Minimum configuration: at prefix value.
Rows:
-
Define the number of rows of data to be generated.
-
The number of rows that can be generated is influenced by the number of fields configured, according to the limitations of a dataset.
-
Default: 100
-
Minimum configuration: at least one row of data.
Seed Value:
-
This value determines which random dataset will be generated.
-
Using the same seed value on each configuration will ensure that the same random values are generated for each dataset.
-
A seed value of 0 will randomly choose a seed value, so that a random dataset is generated.
-
Default: 0 - a random dataset will be generated.
-
Minimum configuration: a seed value must be defined.
Step 2: Field configuration
This is where the boundaries of double, integer and string fields are set. Timestamp fields, if selected, will be defined in the next window.
Double fields:
The number of double fields required will be included in the heading of the group box.
-
Set the boundary limits to the numbers that can be generated. These values are included into the potential value range.
-
Maximum and minimum: Only numbers can be entered, and these are only limited by the maximum and minimum values that are defined as a double value. 64 bit values are supported.
-
Minimum values must be smaller than maximum values.
-
Null probability: the maximum probability of a row being set to null. This is a percentage value.
-
Default: minimum = 0, maximum = 1, null probability = 0.
-
Minimum configuration: value limits need to be defined for each required field.
Integer fields:
The number of integer fields required will be included in the heading of the group box.
-
Set the boundary limits to the numbers that can be generated. These values are included into the potential value range.
-
Maximum and minimum: Only integer numbers can be entered, and 32 bit values are supported.
-
Null probability: the maximum probability of a row being set to null. This is a percentage value.
-
Default: minimum = 0, maximum = 100, null probability = 0.
-
Minimum configuration: value limits need to be defined for each required field.
String fields:
The field names will be applied to every field.
-
See the listed Expression Help for details on generating a string expression.
-
Null probability: the maximum probability of a row being set to null. This is a percentage value.
-
Default: expression = 8 random letters, null probability = 0.
-
Minimum configuration: at least one string character.
Step 3: Configure timestamps
Define the window from which the timestamp must be generated, as well as the sampling period to be used. The number of timestamp fields required will be included in the heading of the group box.
Window selection:
-
There are 3 different ways to define the window start and end values - select the appropriate radio button.
-
Start + End:
-
-
The duration of the interval selected will be shown in the duration window.
-
The end date of the interval can never be before the start date.
-
-
Start + Duration:
-
-
Set the duration of the interval selected in the duration window.
-
The end date of the interval will change automatically.
-
-
Duration + End:
-
-
Set the duration of the interval selected in the duration window.
-
The end date of the interval will remains constant as selected from the calendar.
-
Sampling period
-
Fixed Sampling period: The data interval will be divided equally by the number of rows selected. (Start time equates to the value of row 1, and End time equates to the value of the last row.) The calculated sample period is listed, and values will be taken at this point.
-
Variable Sampling period: The total data interval will be divided equally by the number of rows selected, creating equal sampling interval ranges. (Start time equates to the value of row 1, and End time equates to the value of the last row.) This calculated sample range is listed, and values will be taken randomly at any time within this sample range.
Step 4: Execute
The random values are generated and the new dataset will populate the workspace.
Related topics: